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Abstract 

Using feature-based Tree Adjoining Grammar (TAG), this paper presents linguistically motivated 
analyses of constructions claimed to require multi-component adjunction. These feature-based TAG 
analyses permit parsing of these constructions using an existing unification-based Earley-style 
TAG parser, thus obviating the need for a multi-component TAG parser without sacrificing 
linguistic coverage for English. 
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1 Introduction 

It has been argued that the analysis of certain 
linguistic constructions requires an extension of 
the basic tree-adjoining grammar (TAG) formal- 
ism to include multi-component adjunction [4, 3]. 
The restricted version of multi-component adjunc- 
tion suggested for these constructions does not 
change the weak or strong generative capacity of 
the formalism. This paper demonstrates how these 
constructions can be handled using feature-based 
TAGs, thereby eliminating the need to construct 
a parser for TAGs with multi-component adjunc- 
tion. This would make it possible to parse such 
constructions with the current implementation of 
the feature-based TAG parser [7]. 

Our analysis first develops the alternative sug- 
gested by Kroch and Joshi (1987)[4] for handling 
extraposition with features and then extends the 
approach to the other cases in English that appear 
to require multi-component adjunction, such as ex- 
traction from PP adjuncts and extraction from in- 
direct questions. The feature based TAG analyses 
for these cases are as linguistically well-motivated 
as analyses that require multi-component adjunc- 
tion (e.g. [4]). 



2 Tree Adjoining Grammar 
(TAG) formalism 

The analysis in this paper is based on two exten- 
sions to the TAG formalism developed in Joshi, 
Levy, Takahashi (1975) [2]: feature structures [8] 
and multi-component adjunction [2, 3, 4]. The 
reader is referred to the references cited in this sec- 
tion for more detailed discussion of the formalism 
than will be provided in this paper. 

The primitive elements of the TAG formalism, 
ELEMENTARY TREES, are of two types: INITIAL 
TREES and AUXILIARY TREES. In a TAG gram- 
mar for natural language, INITIAL TREES are phrase 
structure trees of simple sentences containing no re- 
cursion, while recursive structures are represented 
by AUXILIARY TREES. Examples of initial and aux- 
iliary trees are shown in Figure 1. Nodes on the 
frontier of initial trees are marked as substitution 
sites by a (J,), while exactly one node on the fron- 
tier of an auxiliary tree whose label matches the 
label of the root of the tree, is marked as a foot 
node by a (*). The other nodes on the frontier of 
an auxiliary tree are marked as substitution sites. 
The elementary trees define the domain of locality 
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Figure 1: Elementary Trees for John walked to Philadelphia 
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over which constraints are specified. 

Elementary trees are combined by operations of 
SUBSTITUTION and ADJUNCTION. Substitution in- 
serts elementary trees into substitution nodes that 
appear on the frontier of other elementary trees. 
Adjunction grafts auxiliary trees into elementary 
trees at the node whose label is the same as root 
label of the auxiliary tree. As an example, the com- 
ponent trees shown in Figure 1 can be combined to 
form the sentence John walked to Philadelphia as 
follows: 

1. Figure 1(a) substitutes at the NPq node of Fig- 
ure 1(b). 

2. Figure 1(d) substitutes at the NP node of Fig- 
ure 1(c). 

3. The result of step (2) above adjoins to the VP 
node of the result of step (1). The final result 
is shown in Figure 1(e). 

The trees that can be adjoined at a node can 
be constrained by specifying one of the following 
adjoining constraints at that node. 

• Selective Adjoining (SA): Only a specified sub- 
set of all the auxiliary trees is adjoinable at the 
node. 

• Obligatory Adjoining (OA): At least one of all 
the auxiliary trees must be adjoined at the 
node. 



• Null Adjoining (NA): No auxiliary tree is ad- 
joinable at the node (node marked by NA). 

Feature structures can be added to the basic 
TAG formalism [8, 9] by associating a top and a 
bottom feature structure with each node. While 
the top feature structure at a node expresses the 
constraints specified by the structure above the 
node, the bottom feature structure expresses the 
constraints specified by the subtree associated with 
the node. 

When adjunction is performed at a node, the 
node "splits" and the features on the resultant tree 
are formed as shown in the schematic Figure 2 be- 
low. 

Creating a feature clash between top and bottom 
features of a node is equivalent to putting an OA 
constraint at that node. At the end of a derivation 
the top and bottom features of all nodes must unify. 
Potential feature unification failures due to a in- 
compatibility between the top and bottom feature 
values of a node can be averted by performing ad- 
junction at that node. Adjoining an auxiliary tree 
whose root has features compatible with the top of 
the node and whose foot has features compatible 
with the bottom of the node will separate the con- 
fiicting top and bottom features thereby preventing 
unification failure at the node. 

Multi-Component adjunction extends the basic 
formalism by having sets^ of trees rather than sin- 
gle trees. There are several ways of defining the 

"^We will continue to use the term 'set' in this paper for 
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Figure 2: Schemata for feature formation upon adjunction 
[t=top b=bottom r=root f=foot U=unification] 




(a) (b) (c) 

Figure 3: Component trees for A man fj- arrived [who knew Maryji 
in Kroch and Joshi's analysis 
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Figure 3(d): Derived tree for A man fj- arrived [who knew Maryji 
in Kroch and Joshi's analysis 



composition of tree-sets [10]. One of them is tree- 
local composition. Tree-local composition requires 
all members of a tree-set to adjoin or substitute 
on to the same elementary tree. The addition 
of tree-local multi-component adjunction does not 
increase the generative capacity of the formalism 
[4]. This paper demonstrates the use of feature- 
based TAGs to simulate tree-local multi-component 
adjunction using linguistically motivated features. 
With the feature-based analysis, the constructions 
which seemed to require multi-component adjunc- 
tion can be parsed with the current implementation 
of the feature-based TAG parser. 

3 Extraposition 

Joshi and Kroch (1987) [4] propose a multi- 
component adjunction analysis for extraposition, 
which is illustrated by (1) below. 

(1) A man fj- arrived [who knew Mary],- 

Their analysis requires a two-member tree-set 
containing the extraposed constituent, and the 

convenience although the term 'sequence' is more accurate. 
In theory, the grouping in question could consist of two in- 
stances of the same tree, where both instances were required 
and could not viewed as a one member set. In practice this 
problematic instance does not seem to occur. 



empty category to which the extraposed con- 
stituent is coindexed. For example, the auxiliary 
tree-set would be as in Figure 3(b) and Figure 3(c). 
The members of the tree-set would adjoin to the 
initial tree in Figure 3(a) to form the derived tree 
shown in Figure 3(d). 

Our analysis for this type of construction sim- 
ulates multi-component adjunction using linguisti- 
cally motivated features. The feature-based TAG 
analysis requires three elementary trees, shown in 
Figure 4. 

The initial tree is the one needed for the simple 
sentence A man arrived. This tree is augmented 
with features displ_const and displ_const_index 
at the root node Sr and subject node NPq. The 
value of the displ_const feature at a node indi- 
cates if the node dominates a trace of a displaced 
constituent. The displ_const jndex feature iden- 
tifies the trace with the extraposed element. 

In the initial tree, the displ_const feature has 
no value, and is only coindexed between the top of 
NPo and the bottom of Sr nodes. The substitution 
of a man at NPq instantiates <displ_const = -> 
on the bottom of NPq, correctly representing the 
fact that the NP, a man, does not contain a trace 
of a displaced constituent. If the derivation stops 
at this stage the unification of the top and bottom 
features of each node will result in both NPq and Sr 
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Figure 4: Component trees for A man fj- arrived [who knew Maryji in our analysis 



having <displ_const = ->. However, in the case 
of extraposition, either the extraposed clause repre- 
sented by the tree in Figure 4(b) or the trace, rep- 
resented by the tree in Figure 4(c) will adjoin to the 
tree in Figure 4(a). Either of these two adjunctions 
alone will introduce <displ_const = +> into the 
initial tree, resulting in a feature unification fail- 
ure with <displ_const = -> that is present in the 
initial tree. The site of the unification failure is de- 
cided by the order of adjunction of these trees to 
the initial tree. If Figure 4(b) adjoins to Sr node 
then the unification failure occurs between the top 
and the bottom feature structures at the NPq node. 
If Figure 4(c) adjoins to NPq node then the unifica- 
tion failure occurs between the top and the bottom 
feature structures at the Sr . This correctly repre- 
sents the linguistic fact that with only the extra- 
posed clause or only the trace, the resultant tree 
is incomplete and the derivation should not be ac- 
cepted. If both the components adjoin, the resul- 



tant tree contains no feature unification confiicts 
and the derivation may be accepted. 

Feature unification failures at a node can be 
remedied by adjoining an auxiliary tree whose root 
and foot nodes have feature structures compatible 
with those at the site of feature unification fail- 
ure. In the case where the tree with the extra- 
posed clause adjoins on to the Sr node of the initial 
tree, the feature unification failure at the NPq node 
obligatorily forces the adjunction of the tree with 
the needed extraction site, shown in Figure 4(c), at 
the NPq node. On adjoining, the confiicting fea- 
ture values for displ_const feature are separated 
and are no longer required to unify, thus resolv- 
ing the unification clash. The displ_const_index 
feature passes the index between the empty cate- 
gory and the extraposed clause through the initial 
tree. Note that even though the indexing even- 
tually is between elements from different auxiliary 
trees, dependencies only need to be stated within 
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Figure 4(d): Derived tree for A man fj- arrived [who knew Maryji in our analysis 



elementary trees so locality is not violated. It must 
also be noted that the dispLconst feature serves to 
ensure that both the trees (the tree with the empty 
category and the tree with the extraposed clause) 
are adjoined collectively into the initial tree. 

The displ-const feature differs from the AD- 
JUNCT feature of HPSG [6, 5] in the following 
way. The displ_const feature represents the pres- 
ence of a displaced element in a given derivation 
and requires the introduction of the constituent 
to which the displaced element is related, whereas 
HPSG's ADJUNCT feature represents a list of 
potential adjuncts. 

Extraposition of a relative clause on an object as 
in (2) can be handled in the same way as extrapo- 
sition of relative clauses on subjects. For relative 
clauses on objects, the auxiliary tree containing the 
extraposed clause adjoins to the object NP instead 
of the subject NP. 

(2) John gave everyone fj- a hard time [who knew 
Mary]i 

In contrast to adjuncts, which must be intro- 
duced by adjunction, place-holders for arguments 
are present in the initial trees as substitution nodes. 
Therefore extraposition out of argument position, 
such as shown in (3), requires that the tree with the 
trace be substituted rather than adjoined as in the 
cases previously discussed. But for this difference. 



the analysis is similar to the analysis for adjunct 
extraposition. 

(3) I told John fj- yesterday [that I wanted 
pizza]i. (= Kroch and Joshi (1987) [4] (50 a)). 

4 Extraction from PP ad- 
juncts 

Extraction from PP adjuncts has been largely ig- 
nored in the literature because it has been thought 
to be categorically ungrammatical. However, there 
are examples such as (4) that are perfectly gram- 
matical. 

(4) Which gate did you leave from? 

Accounting for the variation in grammaticality of 
extraction from adjunct PP's is beyond the scope 
of this paper. Rather than discuss the considerable 
linguistic issues raised by these constructions, we 
assume that such PP adjunct extractions should 
be included in the grammar, and propose a TAG 
analysis. 

PP adjunct extractions are very similar to the ex- 
traposition discussed earlier. In both cases, items 
that are related to each other cannot be in the same 



6 



VP [displ_const_index : <1>] 
[dispLconst : +] 




[dispLconst : 



(a) 



dispLconst : -] 
dispLconst : <1> [ ] 
displ_const_index : <2> [ 




Sc [dispLconst ; -] 

rdispl_coiist_index : <1> 
[dispLconst : <2> 




P NP[index : <1> [ ]] NP[index : <1> [ ]] S* 

[ ] [ ] rdispLconst_index : <1>1 

[displ_const : + J 



NP VP 



Mary 



wliere 



displ_const : <1> 
displ_const_index : <2> 
displ_const : -] 




NP[mdex ; <1>] [dispLconst ; -] 

[ ] rdispl_coiist_mdex : <2> [ ]1 

[dispLconst : <3> [ ] J 



S/ [displ_const_index : <2>1 

[displ_const : <3> J 

[displ_const_index : <1>1 

[displ_const : <4> + J 




V Sf |displ_const_index : <1> [ 
displ_const : <2> [ ] 

[ ] 



did 



(c) 



did NP 



VPrdispl_const : <4> 

[di8pl_const_index ; <1>J 
[displ_const 




P NP[index ; <1> [ 



walls 



(d) 



sleep under 



(e) 



Figure 5: Component trees and the derived tree for Where did Mary walk to ? 



initial or auxiliary tree. For extraposition, the re- 
lated items were the empty category and the extra- 
posed clause; for adjunct PP extractions the related 
items are the complement of the 'stranded' prepo- 
sition and the extracted NP. The trees in Figure 5 
show the necessary component trees for deriving 
example (5) below. 

(5) Where,- did Mary walk to fj? 

The adjunction of Figure 5(a), containing the 
stranded preposition to at the VP node in Fig- 
ure 5(b) introduces displ_const = + at the VP 
node and creates a feature unification failure at the 
S node. This forces the adjunction of Figure 5(b), 



containing where in sentence initial position, at the 
S node. The analysis for adjunct PP extraction 
is the same as that for extraposition in using the 
displ_const feature to force adjunction of the sec- 
ond auxiliary tree and the displ_const jndex fea- 
ture to accomplish the required coindexation. 

5 Extraction from indirect 
questions 

Kroch (1987) [3] argues that extractions from indi- 
rect questions, as in (6), require multi-component 
adjunction. 
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Figure 6: Schemata of trees displaying feature passing constraints 



(6) (I knew) [which book],- the students would 
forget whoj £j wrote fj- 

Since which book and the empty category to 
which it needs to be indexed begin in sepa- 
rate elementary trees, Kroch uses multi-component 
adjunction in this case simply as a means for 
achieving the proper indexing. In our analy- 
sis of extraposition, we have already presented a 
mechanism for producing indexing between con- 
stituents that originate in different trees using the 
displ-const jndex feature. Clearly a similar use 
of displ-const jndex feature will work in the case 
of extraction from indirect questions to achieve the 
desired result. We omit the details for the sake of 
brevity. 

6 Constraints on tree struc- 
ture and feature passing 

The mechanism required for simulating multi- 
component adjunction using the displ_const and 
displ-const jndex features is very constrained 
and consistent across the phenomena we have ex- 
amined. The structure of the component trees 
and the direction and extent of feature pass- 
ing characterize the constraints. There are three 



types of trees involved: initial trees shown in Fig- 
ure 4(a) and auxiliary trees with and without e 
leaves shown in Figure 4(b) and Figure 4(c) respec- 
tively. Structures of initial trees are not constrained 
since all subcategorization possibilities must be al- 
lowed. However feature passing in initial trees is 
quite constrained. As can be seen in Figure 4(a), 
the smallest phrasal constituents of the tree have 
displ-const = - as a bottom feature and the root 
node has dispLconst = - as a top feature. The 
dispLconst and dispLconst jndex features are 
coindexed between the bottom of the root and the 
top of the smallest phrasal constituents. The ad- 
junction of either type of auxiliary tree instanti- 
ates values for the coindexed features that results 
in obligatory adjunction of the second auxiliary 
tree. Auxiliary trees with the e leaf consist of a 
root with two daughters: a foot node and a node 
which dominates e. The displ_const jndex fea- 
ture in the top feature structure of the root node 
is coindexed with the index feature of the daugh- 
ter node dominating e. The footnode of these trees 
has displ-const = + and the bottom of the root 
has dispLconst = -. The auxiliary trees that do 
not contain an e leaf have the displ_const_index 
value coindexed between the two daughters of the 
root node as shown in Figure 4(c). The footnode 
of these trees is always S and has the feature value 
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displ_const = +. These tree configurations and 
features insure that the e is always c-commanded 
by the constituent in the non-e tree with which it 
is indexed. 

7 Conclusion 

English constructions that have been argued to 
require multi-component adjunction can be han- 
dled in a linguistically well-motivated manner by 
feature-based TAG analysis. In the cases of ex- 
traposition and extraction from PP adjuncts, the 
feature-based TAG analysis essentially simulates 
tree-local multi-component adjunction. For extrac- 
tion from indirect questions the problem of index- 
ing can be handled just as well by a feature-based 
analysis as by multi-component adjunction. We 
have also used the technique discussed in this pa- 
per for handling subject-auxiliary inversion. The 
details of the implementation are discussed else- 
where [1]. Also not discussed in this paper is the 
feature-based TAG analysis for extraction from re- 
cursively embedded NPs [1] which is superior to 
the analysis using multi-component adjunction pro- 
posed by Kroch [3]. These constructions, with the 
feature-based TAG analysis, can be parsed using 
the currently implemented unification-based TAG 
parser. This demonstrates that the implementa- 
tional advantages of feature-based TAG can be en- 
joyed without any sacrifice in linguistic coverage. 
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